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(54) Decoder and method of decoding usin pseudo two pass decoding and one pass encoding 



(57) A method .of processing video frame data, in- 
cluding the steps of: 

(a) receiving a video frame; 

(b) partially decoding the video frame; 

(c) fully decoding the video frame to produce mac- 
roblocks; 



(d) detemnining video data parameters from the par- 
tially decoded video frame or both the partially and 
fully decoded video frame; 

(e) encoding the macroblocks based on the deter- 
mined video data parameters to provide a com- 
pressed video frame for subsequent display. 
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Description 

FIELD OF THE INVENTION 

5 [0001] The present invention relates to a decoder and a method and system of video data decoding, and in particular 
the decoding of MPEG video bitstreanns, that provides memory savings in frame buffers. 

BACKGROUND OF THE INVENTION 

10 [0002] There is a continuous need for reducing the memory requirements of a video decoder in order to reduce costs. 
However, it is also expected that the subjective quality of the decoded video will not deteriorate as a result. Memory 
reductions (for example In the ratio of 1 0:1 or higher) will also enable the effective embedding of memory components 
within a hardware decoder system component, simitar to the embedded dynamic random access memory ("embedded 
DRAM") technology 

15 [0003] For example, a typical video decoder chip like an MPEG-2 decoder uses a significant amount of memory for 
storing the data in frame buffers to be used for decoding temporally linked video frames, video post-processing and 
for storing on-screen display information for feature enhancements. The reduction of memory requirements, especially 
in relation to video decoding and post-processing, has been the subject of much research since it can provide significant 
savings in manufacturing costs. 

20 [0004] Memory reduction with decimation in the spatial domain causes blurring of the image while decimation in the 
frequency domain by applying a fixed bit rate to encode a macroblock or block, as suggested In the prior art, causes 
unpredictable artefacts which arc especially apparent in fast moving video sequences. 

[0005] Another challenge of recompression is the precision of bit rate control. Since the size of the physical memory 
to be used in a system may be fixed, the rate control of the variable-length encoding circuit (or entropy encoding) must 
25 be accurate, such that the maximum memory is utilized without exceeding the allocated memory size. Known methods 
such as virtual-buffer-fullness control may not be used Independently since the variations in the generation of bits 
would not be ideal for a fixed and maximally utilised memory buffer. A tighter control of accuracy for the virtual-buffer- 
fullness method results in the degradation of picture quality, while better picture quality is associated with a high variation 
in the bit rate. 

30 

SUMMARY OF THE INVENTION 

[0006] The present invention provides a method of processing video frame data, including the steps of: 

35 (a) receiving a video frame; 

(b) partially decoding the video frame; 

(c) fully decoding the video frame to produce macroblocks; 

(d) determining video data parameters from the partially decoded video frame or both the partially and fully decoded 
video frame; 

40 (e) encoding the macroblocks based on the detemnined video data parameters to provide a compressed video 
frame for subsequent display 

[0007] In another aspect, the invention provides a video decoder adapted to perform the above method. 
[0008] The present invention further provides a video decoder including: 

45 

(a) a bitstream parser for receiving a video frame; 

(b) an embedded decoder for partially decoding the video frame and fully decoding the video frame to produce 
macroblocks; 

(c) a data analyzer for detemnining video data parameters from the partially decoded video frame or both the 
so partially and fully decoded video frame; 

(d) an embedded encoder for encoding the macroblocks based on the detemnined video data. 

[0009] To achieve a significant compression ratio while rendering acceptable picture quality and minimizing imple- 
mentation complexity, the present invention provides a method of applying pseudo two-pass decoding and one-pass 
55 encoding of input video data. In the first pass of the two-pass decoding process, the Input video bitstream is decoded 
partially to extract useful picture statistics for use in the subsequent one pass encoding process. The second pass of 
the two-pass decoding process is performed in parallel with the one-pass encoding process, which allows the storage 
of the decoded picture within a target-reduced amount of memory. 
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[001 0] The preferred embodiment re-encodes each anchor frame (which may contain either an l-Plcture or P-Picture) 
as an l-Plcture to the desired memory compression ratio with minimum trade-off in picture quality and system com- 
plexity. Also, for cases where the display resolution is less than the bitstream resolution, a non-anchor picture (i.e. a 
B-PIcture) can be decoded on-the-fly using a two-pass decoding technique to significantly reduce the overall memory 

5 requirements of the system. 

[0011] In the case of a standard MPEG decoder, the one-pass encoding process takes the techniques of standard 
Intra macroblock encoding with Discrete Cosine Transfonn ("DCT"), quantization, the scanning of DCT coefficients in 
a zig-zag pattern, and Variable-Length Coding ("VLC"). To minimise picture degradation and maximise the useful picture 
statistics that can be extracted during the first pass of the two-pass decoding process in the present invention for all 

10 frames, the one-pass encoding process used in a preferred embodiment utilises encoding techniques similar to that 
described for a standard MPEG encoder. 

[001 2] Embodiments of the invention also relate to a system for pert onning two-tiered rate control that re-compresses 
video data using present compression techniques. Based on the picture statistics derived from the first pass of the 
two-pass decoding process, the two-tiered rate control scheme is applied to the one-pass encoding process to deter- 

15 mine the quantizer scale for efficient bit allocation. The detenninatlon of a suitable quantizer scale enables effective 
compression while maintaining good picture quality. Artefacts caused by the dropping of DCT coeffk:ients during quan- 
tization are significantly reduced since decimation In the frequency domain Is mostly perfonned within the ideal com- 
pression limit of the system, but rarely within the expansion range of the quantization process. Typically, the compres- 
sion scheme supports a 10:1 memory reduction per frame buffer. 

20 [0013] In addition, the rate control scheme is able to stabilise the bit rate generation of the compression scheme and 
maintain variations in the bit rate generally within 10% of the average bit rate. This rate control accuracy makes the 
present system suitable for use In systems with fixed memory buffers. Hence a video decoder with an embedded 
memory system may be built in accordance with the principles of the present invention. 

25 BRIEF DESCRIPTION OF THE DRAWINGS 

[0014] Preferred embodiments of the present invention are hereinafter described, by way of example only, with ref- 
erence to the accompanying drawings, wherein: 

30 FIG. 1 is an illustration of the decoding of an interpolated macroblock in the B-Plcture; 

FIG. 2 Is a flow diagram illustrating the interaction of various modules employed in a compression scheme of an 
embodiment of the present invention; 

FIG. 3 is an example of an implementation architecture of a reduced memory video decoder of an embodiment of 

the present invention, 

35 FIG. 4 is a timing diagram illustrating an example of how the present invention processes a MPEG-2 video bitstream 

with a frame encoding sequence of {I, P, B, ...}; 

FIG. 5A is a graphical representation of the normal case of maximal overlap of the predicted macroblock on the 
macroblock grid of compressed frame; 

FIG. 58 is a graphical representation of the boundary case of equal overlap on 4 macroblocks; 
40 FIG. 5C is a graphical representation of the boundary case of equal overlap on 2 of the 4 macroblocks; 

FIG. 6 is a block diagram of a rate control circuit according to an embodiment of the invention; 
FIG. 7 is a block diagram of the embedded encoder module shown in Figure 3; 

FIG. 8 Is a block diagram of a decoder of another embodiment of the present invention that supports lower resolution 
picture or zoom-out picture decoding. 

45 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS OF THE INVENTION 

[0015] Embodiments of the present invention are applicable to an MPEG-2 video decoder. The MPEG-2 specifica- 
tions (also referred to as "ISO/IEC 13818") achieves significant compression by removing temporal redundancy be- 

50 tween frames close in time. This is done in addition to removing spatial and statistical redundancy within a frame by 
DCT or entropy encoding and lossy compression by quantization. The temporal element implies that the encoding of 
a frame is not limited to information within the frame itself but information that may span across several pictures. In 
MPEG-2, the tenri "picture" refers to either a frame or a field. Therefore, a coded representation of a picture may be 
reconstructed Into a frame or a field. 

55 [0016] Figure 1 illustrates an example of MPEG decoding of a macroblock In a B-Picture 202, which In this example 
requires forward prediction from an l-Picture 201 and backward prediction from a P-Picture 203. The sequence of 
frames 204 reflects the order in which the frames will be displayed, starting from the l-Picture 201 . An l-Picture 201 
(short for "intra picture") is encoded purely with infonmation within the picture. A P-Pteture 203 (short for "predicted 
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picture") is encoded wllh information from an earlier l-Plcture 201 , or from an earlier P-Picture, in addition to information 
representing the current frame. A B-Plcture 202 (short for "bidirectional picture") Is encoded with infomiatlon from both 
or either of an l-Picture 201 and P-Plcture 203 to be displayed earlier and later than the cun-ent B-Picture 202. Even 
with "B-on-the-Fly" decoding, which involves the direct decoding of B-Picture bitstreams for display without intemiediate 
5 storage, at least two anchor frames (i.e. an (-Picture or P-Picture) are required. Each anchor frame has a maximum 
size of approximately S megabits (derived by: 720 horizontal pixels x 576 lines x 1 .5 bytes per pixel x 8 bits per byte 
» 5 megabits) in the case of a PAL picture of D1 resolution. 

[0017] It is desired to re-encode each anchor frame (I.e. an l-Picture or P-Plcture) as an l-Plcture to the desired 
memory compression ratio with minimum trade-off in picture quality and system complexity. 

10 [001 B] Figure 2 shows the general operation of the memory reduction scheme in the decoder of a preferred embod- 
iment. The MPEG video bitstream is partially decoded by a BItstream Parser circuit 301 , which applies variable-length 
decoding on the video bitstream to obtain the quantized OCT coefficients and macroblock infonnation for all macrob- 
locks and related picture Infomnation in each frame being processed. This is the first pass of the two-pass decoding 
process. The Information from the partially decoded MPEG video bitstream Is passed to a Data Analyzer circuit 302, 

15 which further extracts picture statistics and macroblock information in relation to each frame being processed. The 
Bitstream Parser 301 retrieves macroblock and picture information from the pictures stored in compressed frame buffers 
and the Data Analyzer circuit 302 uses this further macroblock Infomriation and picture statistics as described later. 
[0019] An MPEG-2 Decoder circuit 303 completely decodes the MPEG video bitstream to produce macroblocks, 
which are later re-encoded by an Embedded Encoder 308 for storage in the compressed frame buffers and referenced 

20 by an address. The stored macroblocks may be later retrieved from memory for the motion compensation of predicted 
macroblocks, which may be decoded from later frames, thus forming the final pixel values in preparation for display. 
At the same time, macroblock Information Is passed to the Data Analyzer 302 as It Is generated by the MPEG-2 Decoder 
303. The Data Analyzer 302 uses more infomnation from decoding (not shown) of the compressed frame buffers and 
computes Important macroblock parameters (described befow). 

25 [0020] A rate control circuit 304 then uses the earlier computed picture statistics and macroblock parameters from 
the Data Analyzer circuit 302 to derive a suitable quantizer scale for the Embedded Encoder 308. A Macroblock Bit 
Allocation circuit 305 first allocates the target macroblock bits based on the scaled macroblock complexity. The bit 
allocation process allocates the number of bits to be used for encoding the AC coefficients of each macroblock using 
scaled macroblock complexity together with adjustments from a proportional and integral error feedback controller. 

30 [0021] The equation governing the bit allocation process is given as follows: 



35 



* pic 



where 

40 is the target number of bits for encoding AC coefficients for the macroblock; 

;. is the estimated complexity for the ^ macroblock; 
ip^ is the estimated complexity from current picture statistics; 
s is the target bits for encoding all AC coefficients from current picture statistics; 
^^\s the number of bits for encoding the error of the AC coefficients for the (/-O* macroblock; 



Off Is the accumulation of the number of bits for encoding the error of the AC coefficients from the 0*^ macroblock 
50 up to the {h 1)^ macroblock; 

y\ is the local proportional error feedback control constant; and 
Y is the integral error feedback control constant. 



55 



[0022] For the present system the macroblock complexity, x >is generally defined as: 
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r 

where 

s is the number of bits used for encoding AC coefficients of a macroblock; and 
q is the quantizer scale used for the macroblock. 

5 

[0023] The process of scaled macroblock complexity bit allocation is developed on the finding that the macroblock 
complexity is relatively constant over a range of quantizer scales. A smaller quantizer scale generates more bits for 
encoding the AC coefficients of the same macroblock while a macroblock with higher complexity requires more bits 
for encoding its AC coefficients for the same quantizer scale. 

10 [0024] The decoder differentiates between the number of bits generated by the AC and DC coefficients. As DC 
coefficients affect the contrast of the picture, it is critical that the recompression technique used in the Embedded 
Encoder 308 uses the same intra-DC precision and prevents any further loss of infomiation. This differentiation also 
benefits the second process In the rate control scheme, known as the quantizer scale prediction. 
[0025] A Quantizer Scale Prediction circuit 306 then predicts the corresponding quantizer scale to be used in en- 

is coding the AC coefficients of each macroblock using an inverse complexity relation with adjustments made by mismatch 
control. The equation governing the quantizer scale detemnination is given as follows: 



where 

25 ^ is the predicted quantizer scale for the macroblock; 

X estimated complexity of the macroblock; 

Is the target number of bits used for encoding AC coefficients for the ^ macroblock; 
e/.1 Is the number of bits for encoding the error of the AC coefficients for the {i-lf^ macroblock; 

I 



is the accumulation of the number of bits for encoding the error of the AC coefficients from the 0* macroblock 
35 up to (i-ff^ macroblock; 

o is the local proportional en^or feedback control constant; and 
P is the integral error feedback control constant. 

[0026] In addition, normalization may be applied to the macroblock complexity, where the nonmalised macroblock 
^0 complexity, Xh 's given by: 



45 



where; is the average macroblock complexity of the previous re-encoded picture. Nomrialization is used to achieve 
better subjective quality and higher rate control accuracy. Nomnalized complexity reduces the differences between 
50 macroblock complexities, giving a higher rate control accuracy. In addition, high complexity regions are quantized 
coarser without subjective compromise, translating into bit savings for sensitive low complexity regions that are quan- 
tized finer, and thus achieving a better subjective quality. Quantization comparison is made between the normalized 
and non-nomnalized macroblock complexity. 

[0027] The rate control accuracy is guaranteed by a Feedback Control circuit 307 that implements a two-tier propor- 
55 tional integral ("PI") control loop (illustrated in Figure 6). The inner loop tightly controls the quantizer scale prediction 
accuracy at the macroblock level, while the outer loop compensates for the offset created by the inner loop and con- 
verges the encoding bit count to the target picture bit count. 

[0028] With the derived quantizer scale from the Rate Control circuit 304 and the encoding parameters from the Data 
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Analyzer 302, the Embedded Encoder 308 encodes the stream of decoded pixel values from the MPEG Decoder 303 
via the various video compression techniques in the MPEG-2 specifications, namely DCT, quantization and variable- 
length encoding, to fonm a video bitstream for storage In the compressed frame buffers. 

[0029) Preferably, the Rate Control circuit 304 works in real-time, such that the Feedback Control circuit 307 receives 
5 the actual bit count of the video stream as it is leaving the Embedded Encoder 308 and makes any adjustments using 
the Macroblock Bit Allocation circuit 305 or Quantizer Scale Prediction circuit 306 for the next macroblock in the pipeline. 
[0030] Figure 3 shows an embodiment of the decoder, illustrating an example of an Implementation architecture of 
B-on-the-Fly IVIPEG video stream decoding working In conjunction with a system providing reduced memory MPEG 
video stream decoding. To illustrate how the system operates. It is assumed that the system will process a MPEG-2 
10 video bitstream consisting of frames in the display sequence {I, B, P, ...},withacorrespondingframeencodingsequence 
of {I, P, B, ...}. The video bitstream Is received and stored In a Bitstream Buffer 401 . After the required Video Buffering 
Verifier ("VBV") delay, as defined in the MPEG-2 specifications, the bitstream for the entire picture is stored in the 
Bitstream Buffer 401 and ready for decoding. 

[0031] The decoding of each picture Is performed by a pseudo two-pass decoding and one-pass encoding process, 
15 where the first phase of the two-pass decoding process, represented by a first field time of a frame to be processed, 
Involves extracting macroblock and picture infomnation and computing picture statistics (described below) for the cur- 
rent frame. The second phase of the two-pass decoding process (represented by a second field time of the frame to 
be processed) involves extracting macroblock information for the current frame and also the complete decoding of the 
current frame. The second phase is perfomned in parallel with the one-pass encoding forthe same frame. The encoded 
20 video data is then stored In compressed frame buffers 407. 

[0032] In a first field time of the first frame, t^, a Bitstream Parser 402 decodes the first l-Picture sufficiently for a 
Data Analyzer 403 to perfonn picture statistics computations. In a second field time of the first frame, tg, a MPEG 
Decoder 404 decodes the first l-Picture completely and feeds forward infomnation to the Data Analyzer 403 which 
computes macroblock characteristics. In the same time period, a Rate Control circuit 405 uses the computed param- 
25 eters to detemnine a suitable quantizer scale. An Embedded Encoder 406 encodes the original l-Picture as a new 
l-Plcture using the parameters from both the Data Analyzer 403 and the Rate Control circuit 405 at a macroblock 
latency relative to the MPEG Decoder 404. The final video bitstreams are stored in the Compressed Frame Buffers 
407 and the address location of each macroblock in the Compressed Frame Buffer 407 is mapped into a Macroblock 
Pointer Table. 

30 [0033] In a first field time of the second frame, tg, the Bitstream Parser 402 decodes the P-Picture in the second 
frame sufficiently for the Data Analyzer 403 to perform picture statistics computations The decoding of the P-Picture 
is similar to l-Plctures except that the motion vectors are used to locate predicted macroblock properties, thereby 
providing a good estimate forthe cun^ent inter-coded macroblock properties. Reference herein shall be made to a top- 
field first video image sequence. It is assumed that the processing of all video images begins from line 0, corresponding 

35 to the first line of an image. At the same time a Display Decoder 408 decodes the top field (or only the even-numbered 
lines) of the l-Picture retrieved from the Compressed Frame Buffers 407 and passes the decoded picture to the Standard 
Display 409 for further processing before it is displayed. In a second field time of the second frame, t4, the operations 
of the P-Plcture are similar to that forthe l-Picture (i.e. it is encoded by the Embedded Encoder 406 as a new l-Plcture 
and then stored in the Compressed Frame Buffers 407), except that the motion vectors of the P-Picture as decoded 

40 by the MPEG Decoder 404 are used to locate the corresponding predicted macroblocks in the Compressed Frame 
Buffers 407 using the Macroblock Pointer Table. The located reference macroblocks are retrieved and decoded by an 
Embedded Decoder 410, which operates concurrently with the MPEG Decoder 404 to produce the predicted pixel 
values for the motion compensated picture. In the case of l-Pictures, which has concealed motion vectors, a similar 
procedure tike that for P-Pictures Is followed. At the same time, the Display Decoder 408 decodes the bottom field (or 

45 only the odd-numbered lines) of the l-Picture from the Compressed Frame Buffers 407 and passes the decoded picture 
to a Standard Display 409 for further processing before it is displayed. 

[0034] In a first field time of the third frame, tg, the MPEG Decoder 404 decodes the video bitstream of the B-Picture 

in the Bitstream Buffer 401. However in the case of a B-frame picture, only macroblocks of selected motion vectors 
representative of the top field (i.e. only the even-numbered lines) are decoded. The motion compensated top field 
50 pixels are transmitted to the Standard Display 408 for further processing before the top field is displayed. A similar 
operation is perfomned to the bottom field (i.e. only the odd-numbered lines) during a second field time of the third 
frame, tg. This method of direct decoding B-Picture bitstreams for display without requiring intennnediate storage is 
known as "B-on-the-Fly". 

[0035] The present invention only requires two anchor frames at any one time. Where a further anchor frame appears 
55 in the Group Of Pictures ("GOP") sequence, the new anchor frame Is encoded as a new l-Piclure and stored in the 
Compressed Frame Buffers 407 by replacing the earlier of the encoded l-Picture or P-Picture already stored In the 
Compressed Frame Buffers 407. 

[0036] Figure 4 is a timing diagram illustrating an example of how the present invention processes a MPEG-2 video 
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bitstream with a frame encoding sequence of {I, P, B, ...). The frame encoding sequence in this example also reflects 
the order in which the frames will be decoded, t^ 901 Is the first field time of the first frame, and represents the time 
interval in which the l-Picture in the first frame is decoded for picture statistics. X2 902 is the second field time of the 
first frame, and represents the time interval in which the l-Picture in the first frame Is fully decoded and re-encoded as 

5 a new l-Plcture before it is stored in the Compressed Frame Buffers. 

[0037] tg 903 is the first field time of the second frame, and represents the time interval in which the P-Plcture in the 
second frame is decoded for picture statistics. During the interval t3, the top field of the l-Picture from the first frame, 
Hj, is retrieved from the Compressed Frame Buffers and decoded by the Display Decoder in preparation for display. 
t4 904 is the second time field of the second frame, and represents the time interval in which the P-Plcture in the second 

10 frame is fully decoded and re-encoded as a new l-Plcture before it is stored in the Compressed Frame Buffers. During 
the Interval t4, the bottom field of the l-Plcture from the first frame, 11 is retrieved from the Compressed Frame Buffers 
and decoded by the Display Decoder in preparation for display. 

[0038] ts 905 is the first field time of the third frame, and represents the time interval in which the top field of the 
B-Picture In the third frame Is decoded by the Embedded Decoder (by retrieving from the Compressed Frame Buffers 
IS only those macroblocks representing the top field of the current B-Picture, B3^, as defined by the motion vectors In the 
current B-Frame) and then decoded by the Display Decoder for immediate display, tg is the second time field of the 
third frame, and perfomns the same functionality described for tg but is applied in respect of the bottom field of the 
B-Picture in the third frame, B3,,. 

[0039] During the first field time of an l-Picture or a P-Plcture, the Bitstream Parser 402 variable-length decodes the 
^ video bitstreams after removing the numerous headers (including those headers that define a sequence, GOP, picture, 
slice, or macroblock) and extracts the following picture parameters for storage and macroblock parameters for further 
processing In Data Analyzer 403. 
[0040] Picture parameters: 

25 • q_scale_type, the type of quantization table used (linear or non linear); 

• Intra Quantizer Matrix, the two dimensional 8x8 quantization table used for intra-coding; 

• intra_DC_precision, the number of bits used for coding DC coefficients; 

• altemate_scan, the type of zig-zag scan to perfomn; and 

• intra^vlcjormat, the type of variable length coding table used for intra-coding. 

30 

[0041 ] Macroblock parameters: 

• $^ , the bit count of AC coefficients; 

• a the bit count of DC coefficients; 
35 • f/, , the quantizer scale; 

• the set of K full scale motion vectors decoded from the associated motion vector information; and 

• mbjntra, the boolean representation of intra-coded macroblocks. 

[0042] The parameters ci_scatejype, intraJDCjprecision, altemate^scan, intra_vtcJormat and mbjntra are de- 

40 fined in the MPEG-2 Specifications. 

[0043] Preferably, the macroblock parameters are processed in the Data Analyzer 403 as infomnation is extracted 
from the Bitstream Parser 402 during the first pass decoding and the MPEG Decoder 404 during the second pass 
decoding. The Bitstream Parser 402 or the MPEG Decoder 404 might not give the exact list of parameters desired 
since some of the parameters have to be computed outside the MPEG Decoder 404. For example, the bit count of AC 

45 coefficients can be computed from the difference between the bitstream pointer from the first AC coefficient decoding 
to, and not inclusive of, the end of block code. In addition, some of the macroblock parameters are computed from the 
extracted parameters and some are accumulated to form picture parameters. 

[0044] If the macroblock is intra-coded (i.e. if md_/nfra= 1), the estimated macroblock complexity,!., and the estimated 
bit count of DC coefficient, will be defined as follows: 

50 

[0045] If the macroblock is inter-coded or if the prediction en^or is coded (i.e. if mbjntra = 0), the estimated macroblock 
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complexity^,, and the estimated DC coefficients bit count, s.. will be defined as follows: 

5 

10 where , and are the bit count of AC coefficients, the quantizer scale and the bit count of DC coefficients 
respectively, as derived from the Compressed Frame Buffer 407. Statistical data are gathered from the Compressed 
Frame Buffers 407 rather than from the Bitstream Buffer 401 for inter-coded macroblocks, since the predicted mac- 
roblocks provide a closer image description to the original macroblock than the prediction error. The set of motion 
vectors, 

15 

20 is used to locate the predicted macroblocks using the address references in the Macroblock Pointer Table. 

[0046] For the equations in this specification, the b superscript represents information derived from the Bitstream 
Buffer 401 while the c superscript represents infomriation from the Compressed Frame Buffers 407. Thus, for example,^, 
can be represented bydt otd: , depending on whether the macroblocks to which;? relates are intra-coded or not (i.e. 
whether mbjntra= 1 for those macroblocks). 

25 [0047] Each macroblock is variable-length encoded and has a pointer reference to the start of the macroblock for 
easy reference in motion compensation. Reference herein is being made to a PAL picture (720 pixels x 576 lines) of 
D1 resolution. A macroblock defines a two dimensional region consisting of a 16 x 16 pixel array in the video image. 
Each picture thus requires a Macroblock Pointer Table with a maximum of 1620 entries(where 45 macroblocks per line 
• 36 macroblocks per column = 1620 macroblock entries). A possible Implementation with memory saving is to have 

30 a hierarchical pointer system implemented in the Macroblock Pointer Table. For example, the picture is divided into 
video segments, such that each video segment consists of five consecutive macroblocks. Each picture should have 
324 full segment pointers (1 620 macroblocks per picture / 5 macroblocks per video segment = 324 segments) and four 
incremental macroblock pointers per segment, where each incremental macroblock pointer points to a macroblock 
relative to the previous macroblock within that segment. Hence, for a 4:2:0 chroma sampling fonnat, the largest mac- 

35 roblock size would be 3072 bits (8x8 pixels per block * 6 blocks per macroblock * 8 bits per pixel = 3072 bits per 
macroblock) and the largest segment size would be 15,360 bits (3072 bits per macroblock * 5 macroblocks per video 
segment = 15,360 bits) and the largest picture size would be 4,976,640 bits (3072 bits per macroblock * 1620 mac- 
roblocks per picture = 4,976,640 bits). Assuming that the compression does not expand the original pixel data, a 12 
and 23 bit precision is defined for an incremental macroblock pointer and a full segment pointer respectively. As a 

40 result, a minimum of 23,004 bits (324 segments per picture * (23 bits per full segment pointer + 12 bits per incremental 
segment pointer * 4 incremental segmental pointers per segment) = 23,004 bits) is required to implement the Macrob- 
lock Pointer Table. Preferably, an 8x8 block pointer system that provides a finer resolution is not used since it requires 
an additional 9 bits per block (capped by the amount of uncompressed data per block) and amounts up to an additional 
72,900 bits (9 bits per block * 5 incremental block pointers per macroblock * 1620 macroblocks per picture = 72,900 

4S bits) of memory. 

[0048] By first identifying the segment, within which the macroblock. is located and each macroblock has a 
positional offset, n, It is possible to calculate the address of a predicted macroblock as follows: 



mb _ address ^„ = segment _ address -^2^^^- address _ inc^ 



where 

55 

mb^addresSfjj is the absolute macroblock address of macroblock m; 

segment_addresSf( is the full segment address of segment ^ and mb_addressjnci is the incremental macroblock 
address of a macroblock with a position offset / within a segment k, and where / is an integer ranging from 0 to n- 
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1 and n represents the number of macroblocks within a segment. 

[0049] In contrast to the pixel resolution determined from the motion compensation scheme in a nomial MPEG De- 
coder, macroblock resolution Is used In the present decoder. Inter-coded macroblock properties are derived using 
5 motion vectors on the macroblock grid or a two dimensional boundary. 

[0050] Figures 4A, 4B and 4C illustrate examples of a predicted macroblock on the macroblock grid. As shown in 
Figure 5A, the predicted macroblock has a maximal overlap of pixels in macroblock D. The estimated macroblock 
complexity.;., and estimated bit count of intra-DC coefficients for the inter-coded macroblock,^ , are defined as: 

10 



15 



where Q^- ^ correspond to the bit Count of AC coefficients, the quantizer scale and the bit count of DC coefficients 
respectively, as derived from a simple decoding of macroblock D from the Compressed Frame Buffers 407. As shown 
in Figure 5B, if there Is equal overlap on all macroblocks, any one of the four macroblocks will be used (for example 
20 macroblock D). As shown in Figure 5C, if there is overlap on only 2 macroblocks, either one of the 2 macroblocks is 
used (for example macroblock B). A similar logic is applied at picture boundary conditions. 
[0051] However in the case of multiple motion vectors, the average complexity, X/> and the average bit count of DC 
coefficients, d/, are taken over all motion vectors, and is represented by: 



25 



30 



0 



where and respectively correspond to the macroblock complexity and the bit count of DC coefficients derived, 
as defined above for the motion vector, from the set of motion vectors {mu}^ and where K\s the maximum number 
of motion vectors in the bitstream of a P-Picture according to the picture and motion prediction type, as tabulated in 
35 Table 1 below: 



TABLE 1 



40 



45 



Picture 


Prediction 


K 


Type 


Type 




Frame 


Frame 


1 




Field 


2 




Dual Prime 


4 


Field 


Field 


1 




16x8 


2 




Dual Prime 


2 



50 



[0052] A similar procedure is derived for skipped macroblocks in the P-Picture by setting K = 1 and the corresponding 
motion vector = 0. 

[0053] Referring again to Figure 3, the derived macroblock properties from the Compressed Frame Buffers 407 and 
the extracted macroblock properties from the Bitstream Parser 402 are accumulated at a macroblock level in Data 
Analyzer 403. These macroblock properties are as follows: 
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Z, 

y-0 



the accumulated complexity; 
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the accumulated bit count of AC coefficients from the BItstream 
[00541 Parser; 



the accumulated bit count of DC coefficients from the BItstream Parser; and 



the accumulated bit count of DC coefficients from the Compressed Frame Buffers. 

[0055] When the last macroblock N-1 of the picture has been reached, the picture complexity, picture DC coeffi- 

25 _i» 

dents bit count from the Bitstream Parser 402, [f^ and the picture DC coefficients bit count from the Compressed 
Frame Buffers 407, if are obtained as the result of accumulation. Other picture statistics are computed at the picture 

pic 

level by the equations below. 

[0056] The bit count of a picture's DC coefficients is calculated as follows: 



where Xpi^.type the estimated compression factor of the DC coefficients bit count in the previous picture of the same 
type as described below. 

[0057] The target bit count of a picture's AC coefficients Is calculated as follows: 



pic *^ pte 



Where 0 is the overhead size (in bits) of the compression Including the Macroblock Pointer Table, the 
^5 quantizer_scale_code (a 5 bit parameter from the MPEG-2 Specification) and the dctjype (a 1 bit parameter from the 

MPEG-2 Specification) for all macroblocks in the picture, and where 6^. is the target picture size. The quantizer scale 

is calculated from the quantizer_$cale_code and q^scalejype. The dctjtype represents either performing the discrete 
cosine transfonn in frame fonnat or field fomnat. 

[0058] During the second field time of an l-Picture or P-Plcture, the MPEG Decoder 404 decodes the MPEG video 
bitstream. During the full decoding process, the macroblock parameter dcLtype^^ and all the above mentioned mac- 
roblock parameters (other than the bit count of DC coefficients from the Bitstream Parser 402, d^) are extracted. 

dctjlype^^ indicates whether frame IDCT or field IDCT was perfomned during the decoding process and is useful for 

encoding (such as In the Embedded Encoder 406). The extraction of the above mentioned macroblock parameters 
(other than the bit count of DC coefficients and the dct-type) during the first pass encoding for each macroblock is 
preferably repeated in the full decoding process (I.e. the second decoding process) so as to save buffer space for 
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storing macroblock values, although it Is not necessary for It to be repeated. Macroblock complexity may be computed 
again as described above for the Rate Control circuit 405. Additional parameters, like dct^type and min^qscale, are 
preferably derived in the full decoding process. However, these two parameters may be derived in the BItstream Parser 
402 during the first pass decoding and stored for use during the full decoding process. min_qscale represents the 

5 minimum quantizer scale used and Is relevant for the Mismatch Control circuit 605 (described later In the text) for 
controlling the minimum quantizer scale to be used for embedded encoding. If the quantizer scale used is lower than 
the min_qscale, then there Is no value added because quantization with a parameter smaller than the original encoded 
stream in the BItstream Buffer 401 does not produce a better image quality. The bits saved in the process can be used 
for storing other macroblocks. 

10 [0059] For intra-coded macroblocks, 



da _ type^ = dc( _ type^ 
xnin^qscole- - 

[0060] For Inter-coded and skipped macroblocks, 

20 

datatype, - datatype*; 



min_qscale^ -mm_qscale'; 

25 

[0061] The dctjype^^ and min^qscale^, parameters correspond to the dct type and minimum quantizer scale param- 
eters respectively, and are derived from the Compressed Frame Buffers 407 using the set of motion vectors 

30 



35 The c superscript represents infomiation from the Compressed Frame Buffer 407. 
[0062] For one motion vector 



40 



45 



50 



55 



(with reference to Figure 5A), dct^typei is the dctjype of macroblock D and min_qscale, is the minimum quantizer 
scale of the 4 or less macroblocks of Interest. Both are derived as follows: 



dct _ type^ = dctjype ^ 
min _ qscale^ = min 



[0063] Similar derivations can be done where there is an equal overlap (as shown In Figures 4B and 4C) and for 

picture boundary conditions. 

[0064] In the case of K multiple motion vectors, 
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dct _type, 



default _ dct _ type 
major _dct _type 



Y^dct^typel^KIl 

)tr»0 



Otherwise 



10 



min^qscale, =niin {fninjjscalel}^^^ 



15 



20 



where cfc/.fype^ refers to the DCT type derived for the 1^^ motion vector, which is derived in a similar way to the case 

lor the one motion vector above; defauH^dctJype refers to a fixed dctjype value assigned for cases where equal 
numbers of motion vectors have the same dct_Jype; major_def_type refers to the confomnance to the majority of the 

dct_type that is derived by the set of motion vectors; and min^qscalef Is the minimum min_qscale derived from the 
same set of motion vectors; and min^qscate^^ refers to the minimum quantizer scale derived from the ^ motion vector, 
which is derived in a similar way to the case for the once motion vector above. 

[0065] In addition, referring to Figure 3. Data Analyzer 403 updates two parameters (i.e. the default_dct_type at\6 
DC compression factor) at the picture level in the first or second field time for use in the subsequent pictures. 



25 



default _dctjype - 



1 2] d^^^^V^i >NI^ 
0 otherwise 



50 where N is the number ofmacroblocks in a picture and default^dctjype is the majority of dctjype used for the mac- 
roblocks in the picture. 

[0066] The DC compression factor is calculated as follows: 

Where picjype Is an l-Picture or P-Picture, D^^ Is the bit count of picture DC coefficients from the Bitstream Parser 

402. and Cf^^^ Is the bit count of picture DC coefficients from the re-encoded macroblocks that were intra-coded in the 
pfc 

Bitstream Buffer 401 . This compression factor is calculated for different picture types and may be used in the subse- 
quent pictures of the same type. Typically, Xp^ can be initalized to 1 . 

[0067] With the picture and macroblock statistical infomnation , the Rate Control circuit 405 derives a suitable quantizer 
45 scale for encoding. Figure 6 shows a functional block diagram of the rate control scheme. A Macroblock Bit Allocation 
circuit 601 (shown as 305 in Figure 2) allocates the number of bits to be used for encoding the AC coefficients of the 
macroblock according to scaled macroblock complexity, which is defined as follows: 



X pic 



where is the target number of bits for encoding AC coefficients for the macroblock; 

55 

iyis the estimated complexity for the macroblock; 

ipte is the estimated complexity from current picture statistics; 
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sj^^ is the target bits for encoding all AC coefficients from current picture statistics; and 



is the proportional integral control adjustment for the picture level. 

[0068] The Qscale Prediction circuit 602 (shown as 306 in figure 2) then predicts the quantizer scale with the following 
equation: 



where 



ij is the predicted quantizer scale for the macrobtock; 
ij estimated complexity of the macroblock; 

is the target number of bits used for encoding AC coefficients for the fi^ macrobtock; and 



is the proportional Integral control adjustment for the macroblock level. 

[0069] To stabilize bit rate generation and Improve rate control accuracy, a two-tiercontrol closed loop is implemented. 

The error, e/= sj^ - s^, defined as the difference between the target and re-encoded AC coefficient bit count, is fed 

back to outer and inner Proportional Integral (PI) control circuits 603 and 604 for the macroblock and picture level, 
respectively. The inner (macroblock) PI controller 604 compensates for the inaccuracies of the Qscale Prediction circuit 
602 at the macroblock level and adds an error adjustment, calculated by 



4s to the predicted quantizer scale,;;/, where a = -0.0008 and p = -0.0005 are example constant values. These values are 
exemplary values only. The bounds of these values are related to the convergence of the control loop. 
[0070] On the other hand, the outer (picture) PI controller 603 is concerned with the general stability of the bit rate 
generation and ensures the convergence of the target picture bit count. It adds an error adjustment, calculated by 



if-O 



55 to the target AC coefficient bit count before the scaled macroblock bit allocation takes place. In this case, example 
values can be ti = 0 and y = 1 .5 since the Impact of local error feedback Is less than Integral error feedback for stability 
purposes. These values are exemplary values only. The bounds of these values are related to the convergence of the 
control loop. 
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[0071] Mismatch control is performed by a Mismatch Control circuit 605 and is performed after the quantizer scale 
prediction and error adjustments. The objective is to match the discrete set of quantizer scale values defined by 
q_scale_typear)6 quantizer_scale_code in the MPEG specifications. First the Mismatch Control circuit 605 performs 
saturation on the incoming quantizer scale,; to the range of 2 to 62 for linear quantization (i.e. where q_scale_type = 
5 0), and to the range of 1 to 112 for non-linear quantization (i.e. where q_scalejype = 1). Then the Mismatch Control 
circuit 605 rounds the saturated quantizer scale to a discrete quantizer scale value according to the linear and non- 
linear quantization table. 

[0072] In addition, the Mismatch Control circuit 605 compares the discrete quantizer scale to the m/rLqsca/e variable 
and forces the quantizer scale to equate with the min^qscale variable if the quantizer scale is larger than the value of 
10 min^qscale. This Is to ensure that the Embedded Encoder (shown as item 406 in Figure 3) will encode with a quantizer 
scale no finer or smaller than Its original quantizer scale as picture quality does not improve and encoding bits can be 
saved. 

[0073] However for {-Pictures, if the estimated bit count of picture AC coefficients, calculated by 

is less than the target bit count of picture AC coefficients, s^^, the original quantizer scale, is used instead. 
20 [0074] In an alternative embodiment, a slight adaptation involving Xy (or the normalized macroblock complexity) can 
be used instead of the macroblock complexity, t/. The reason is that high complexity macroblocks have image details 
that are not compromised subjectively by using a coarser quantization step size, whereas low complexity macroblocks 
with smooth and homogenous areas that are more sensitive to the quantization step size and thus requires further 
quantization. 

25 [0075] Nomnalized complexity is defined as follows: 

30 ' 

where % is the average macroblock complexity of the previous re-encoded picture. Nomrialized complexity reduces the 
variation of macroblock complexity, raises the quantizer scale for higher complexity macroblocks and lowers the quan- 
tizer scale for low complexity macroblocks. The replacement of macroblock complexity, »y, by the nomrialised complexity, 
35 In all above steps concerning rate control achieves the advantage of having better subjective quality and higher rate 
control accuracy. 

[0076] The Embedded Encoder (shown as item 406 in Figure 3) re-encodes all anchor frames (namely l-frames and 
P-frames) and all the components of the Embedded Encoder are shown in Figure 7. A Discrete Cosine Transform 
(DCT) circuit 701 perfomns a two dimensional 8x8 DCT on the pixel output of the decoded video bitstream from the 

40 MPEG Decoder (shown as item 404 in Figure 3). For a frame picture, a frame DCT is performed if the dctjype is 
determined to be 0 In the Data Analyzer (shown as item 403 in Figure 3), otherwise a field DCT is perfomned. The 
resultant DCT coefficients are quantized by a Quantizer (Q) circuit 702 using the 8x8 intra-quantizer matrix from the 
Data Analyzer 403 and the derived quantizer scale from the Rate Control circuit (shown as item 405 in Figure 3). The 
8x8 quantized coefficients are then re-an-anged in a zig-zag manner by a zig-zag (ZZ) circuit 703 according to the 

45 predetermined altemate_scan format parameter from the Data Analyzer 403. A Variable-Length Coder (VLC) circuit 
704 subsequently variable-length encodes the one dimensional data in the chosen /nfra_Wc_formaf determined from 
the Data Analyzer 403 using run-length encoding and the Huffman table defined in the MPEG-2 specifications and 
also differential encoding of the luminance DC coefficients within the macroblocks. At the same time, the number of 
bits generated by the VLC circuit 704 is tracked and fed back to the Rate Control circuit 405. 

50 [0077] The VLC circuit 704 also ensures minimal expansion of video data on a 8x8 block basis by limiting the max- 
imum number of bits for an 8x8 block to 682 bits, or by dropping the last few coefficients if the limit is exceeded. The 
reason for this is that the macroblock pointer has a fixed precision of 12 bits and any expansion of data is Inefficient 
on memory savings. Furthemiore, limiting the bit count on the expanded data rate does not have an adverse effect on 
the picture quality. The entire encoding process is similar to the MPEG encoder except for the absence of the motion 

S5 estimator, the decoding loop, motion vectors and MPEG conforming bitstreams. The components in Embedded En- 
coder 406 can be built from standard components in an MPEG Encoder 

[0078] Referring again to Figure 3. as the bitstreams from the Embedded Encoder 406 are generated , they are stored 
into the Compressed Frame Buffers 407 and the Macroblock Pointer Table Is updated with the new starting address 
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of each macroblock. In the decoding of B-Plctures. anchors frames are required to provide the forward and backward 
prediction, giving rise to the need of at least two compressed anchor frames (1 -frames or P-frames stored as l-Pictures) 
In the Compressed Frame Buffers 407. For example, with a 1 0: 1 compression scheme, the capacity of the Compressed 
Frame Buffer 407 will be approximately equal to 1 megabits (calculated by 2 frames * 0.1 being the compression factor 

5 * 4,976,640 bits per frame = 1 megabits). 

[0079] Memory space may be shared between the Video Bitstream Buffer 401 , the Compressed Frame Buffers 407, 
the Still Picture Buffer (not shown) and the On-screen Display Graphics Buffer (not shown). The Still Picture Buffer 
stores a compressed still picture from the bitstream that may be decoded as a background picture during run time. 
The On-screen Display Graphics Buffer stores graphics Including texts and logos that are overlaid on screen for special 

10 features, for example the channel menu. The size of the Video Bitstream Buffer 401 varies with the video bitrate, with 
an upper limit determined by the maximum bit rate of the video bitstream. The size of the Compressed Frame Buffers 
may therefore be changed for every video bitstream or for the decoding of a particular bitstream according to application 
needs and memory availability. The number of bits for a picture may also be dynamically allocated for maximal picture 
quality. 

15 [0080] In an alternative embodiment, the present system may be extended to also perfomn lower resolution picture 
decoding. Figure 8 shows a modified architecture of the detector to support lower resolution picture or zoom-out picture 
decoding. The architecture in Figures involves the inclusion of a Decimation Filter 811 and an Interpolation Filter 812. 
All other components in Figure 8 have the same functionality as the correspondingly numbered components described 
in Figure 3. The Decimation Filter Circuit 811 perfomns a horizontal spatial decimation of the display pixels to the 

20 required resolution using a digital decimation filter, for example a 7 tap filter with coefficients [-29, 0, 88, 138, 88, 0, - 
29] for a 2:1 decimation, before it is encoded by Embedded Encoder 406. The horizontally down-scaled version of the 
picture Is stored as variable-length encoded bitstreams in the Compressed Frame Buffers 407 and Is decoded when 
necessary for display by the Display Decoder 408. 

[0081] An Interpolation Filter 812 is needed to perfomn a horizontal spatial interpolation to achieve full D1 resolution 
2s motion compensation using a digital interpolation filter, for example a 7 tap filter with coefficients [-12, 0, 140, 258, 140, 
0, -12] for a 1 :2 interpolation. The two filters can be designed jointly and filters for luminance and chrominance com- 
ponents can be customized to maximize picture quality. These filters are standard components of a video pre-processor 
subsystem in video related application systems. 

[0082] Besides using the above filters, the Data Analyzer 403 may be adapted to support zoom-out modes where 
30 the D1 horizontal resolution is an integer multiple, 6, of a lower horizontal resolution picture. The adaptation includes 
additional analysis being performed once for every 9 macroblocks decoded to compute con'esponding parameters of 
the to-bo-dlsplayed macroblock. Macroblock complexity and quantizer scale are averaged for 6 macroblocks, the min- 
imum min^qscale is selected among 6 macroblocks and the majority dctjype for 8 macroblocks Is confonned to. 

[0083] The bit counts of DC and AC coefficients (for l-Pictures), and J* respectively, are accumulated for every 

35 I 

decoded macroblock as described in the earlier text. The bit count of the respective DC and AC coefficient In a picture 
(for t-Pictures) are calculated based on the to-be-displayed macroblock, as follows: 



[0084] It is obvious to those skilled in the art that special considerations may be made for the picture boundary in 
the case where the width or height of the decimated picture is not an integer multiple of macroblocks. 
50 [0085] To reduce speed complexity issues regarding the B-on-the-Fly decoding of B-Plctures for low resolution dis- 
play, two extra picture buffers may be Included as part of the system to temporarily store B-Pictures. As such, B-pictures 
may be compressed similar to the anchor frame pictures as described above. The size of the Compressed Frame 
Buffers 407 is not compromised as tower resolution pictures produce acceptable picture quality at a smaller target 
picture bit count. 

55 [0086] Further simplification may be done to reduce decoding from a two-pass to a single-pass decoding process 
for B-Pictures. As such, the computation of the Data Analyzer 403 during the first field of decoding is eliminated and 
the macroblock bit allocation scheme in the Rate Control Circuit 405 Is adapted to allocate a constant or averaged 
target bit count of AC coefficients, defined by: 
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where A/ is the number of macroblocks In the low resolution picture, and the bit count of picture DC coefficients (and 
hence the target bit count of picture AC coefficients) may be estimated from the previous picture of the same type. 
[0087] The term "circuit" as used herein with reference to functional components is Intended to include any applicable 
hardware componentry which can accomplish the appropriate function and may include processor chips and ASICs 
as well as basic electronic logic components. The "circuits" may also be implemented as modules executed In software 
or a combination of software and hardware 



Claims 

1 . A method of processing video frame data, Including the steps of: 

(a) receiving a video frame; 

(b) partially decoding the video frame; 

(c) fully decoding the video frame to produce macroblocks; 

(d) determining video data parameters from the partially decoded video frame or both the partially and fully 
decoded video frame; 

(e) encoding the macroblocks based on the detenmined video data parameters to provide a compressed video 
frame for subsequent display. 

2. A method according to claim 1 , wherein the step of partially decoding the video frame includes variable length 
decoding of an input bitstream of the video frame data. 

3. A method according to claim 2, wherein the step of partially decoding the video frame includes variable length 
decoding of the input bitstream for an intra-coded video frame and variable length decoding of a compressed 
anchor video frame bitstream for inter-coded video frames. 

4. A method according to claim 3, wherein the said variable length decoding of a compressed anchor frame bitstream 
includes macroblock address decoding using a macroblock pointer table and the step (d) of detennining includes 
detemilning macroblock-level and picture-levet video data parameters. 

5. A method according to claim 4, wherein said macroblock pointer table is organised into video segments, each 
comprising n macroblocks with a full length segment pointer and n-1 incremental segment pointers, where n is an 
integer. 

6. A method according to claim 4, wherein said macroblock address decoding is perfomned according to: 

/-I 

mb _ address „ = segment _ address., + ^ _ address _ mc, 

where mb_address„ is the absolute macroblock address of macroblock m\ segment_addre$Sf( Is the full segment 
address of segment Ic, and mb_addressjnci is the incremental macroblock address of a macroblock with a position 
offset / within a segment /r, and where / Is an integer ranging from 0 to ^^1 and n represents the number of mac- 
roblocks within a segment. 

7. A method according to claim 1 , wherein step (d) includes one or more of: 

(i) determining macroblock-level and picture-level video data parameters: 

(ii) determining macroblock complexity in the video frame; and 
(Hi) detennining picture statistics of the video frame. 

8. A method according to claim 7, wherein said macroblock-level video data parameters of step (i) include one or 
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more of: an intra coding flag, a bit count of the AC coefficients, a bit count of the DC coefficients, a quantizer scale, 
motion vectors and discrete cosine transfonn (DOT) type. 

9. A method according to claim 7, wherein said picture-level video data parameters of step (i) include one or more 
of: quantizer scale type, Intra quantization matrix, Intra DC precision, alternate scan fonmat and intra variable length 
coding format. 

10. A method according to claim 7, wherein step (d) includes estimating the set of macroblock parameters from com- 
pressed anchor frames using a decoded motion vector. 

11 . A method according to claim 1 0, wherein the step of estimating macroblock parameters from compressed anchor 
frames for inter-coded macroblocks includes one or more of: 

(iv) detennining macroblock parameters Including a bit count of AC coefficients, a bit count of DC coefficients 
IS and a quantizer scale; 

(v) detenmlning a discrete cosine transfonn (DCT) type; and 

(vi) detemnlning a minimum quantizer scale. 

1 2. A method according to claim 1 1 , wherein step (iv) Is based on the maximal overiap principle where the macroblock 
20 assumes the characteristics of the compensated macroblock that is maximally overiapped. 

13. A method according to claim 11 , wherein said determined macroblock DCT type Is based on a majority of the DCT 
types used in motion compensated macroblocks for inter-coded macroblocks. 

25 14. A method according to claim 11, wherein said minimum quantizer scale Is the minimum quantizer scale used 
among motion compensated macroblocks for inter-coded macroblocks. 

1 5. A method according to claim 7, wherein said macroblock complexity is determined as the product of the macroblock 
quantizer scale and the bit count of the AC coefficients of the macroblock. 

30 

16. A method according to claim 7, wherein said macroblock complexity is estimated from the product of an estimated 
. quantizer scale and an estimated bit count of AC coefficients for inter-coded macroblocks. 

17. A method according to claim 7, wherein said picture characteristics include one or more of default DCT type, DC 
35 compression factor, picture complexity, the picture bit count of AC coefficients and the picture bit count of DC 

coefficients. 

18. A method according to claim 1 7, wherein the default DCT type is a fixed parameter value assigned where equal 
numbers of motion vectors have DCT type. 

40 

19. A method according to claim 17, wherein the DC compression factor is defined as the picture bit count of DC 
coefficients of the received video frame divided by the picture bit count of the DC coefficients from re-encoded 
macroblocks. 

45 20. A method according to claim 17, wherein said picture complexity is determined as the sum of the macroblock 
complexities of macroblocks in the video frame or the sum of the estimated macroblock complexities of macroblocks 
In the video frame. 

21 . A method according to claim 1 7, wherein said picture bit count of AC coefficients is determined as the sum of the 
50 macroblock bit counts of AC coefficients or the sum of the estimated macroblock bit counts of AC coefficients. 

22. A method according to claim 17, wherein said picture bit count of DC coefficients is detennined as the summed 
accumulation of the bit counts of DC coefficients from the intra-coded macroblocks multiplied by the DC compres- 
sion factor and accumulation of estimated bit count of DC coefficients from the Inter-coded macroblocks. 



55 



23. A method according to claim 11, wherein the step of determining video parameters from the partially decoded 
video frame includes step (iv) and the step of detennining video parameters from the fully decoded video frame 
Include steps (iv) to (vi), excepting detennining said bit count of DC coefficient. 
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24. A method according to claim 11, wherein the step of determining video parameters from the partially decoded 
video frame includes step (iv) and the step of determining video parameters from the fully decoded video frame 
include steps (v) to (vi). 

5 25. A method according to claim 11 , wherein the step of detemiining video parameters from the partially decoded 
video frame includes steps (Iv) to (vi). 

26. A method according to claim 1 , wherein the step (e) of encoding the macroblocks Includes: 

10 (I) allocating target encoding bits for each macroblock; 

(il) predicting a quantizer scale for each macroblock; 

(ili) implementing control loop feedback; and 

(iv) encoding the macroblock based on said video data parameters. 

'5 27. A method according to claim 26, wherein the allocation of target macroblock encoding bits includes scaling of 
macroblock complexity relative to picture complexity with a feed back control adjustment, as described by the 
equation: 



20 




25 

where Is the target number of bits for encoding AC coefficients for the fi^ macroblock;; , is the estimated complexity 

for the fi^ macroblock; jp^^ is the estimated complexity from current picture statistics; is the target bits for 

pic 

encoding all AC coefficients from current picture statistics; and is a proportional integral control adjustment for 
the picture level. 

28. A method according to claim 26, wherein predicting the quantizer scale includes an approximation of the rate 
quantization with a feedback control adjustment, as described by the equation: 

35 
40 

wherei , is the predicted quantizer scale for the macroblock; estimated complexity of the macroblock; is 

the target number of bits used for encoding AC coefficients for the macroblock; and ^^fs ^ proportional integral 
45 control adjustment for the macroblock level. 

29. A method according to claim 27, wherein said predicted quantizer scale is constrained to be above a minimum 
quantizer scale. 

50 30. A method according to claim 26, wherein said control loop feedback includes a two tier closed control loop with 
an inner loop controlling quantizer prediction accuracy and an outer loop compensating for bit rate accuracy. 

31. A method according to claim 28, wherein said control loop feedback is based on the error between the actual and 
target encoding bits and provides proportional integral or proportional control adjustments by applying the equation 

55 
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/-I 

5 

to the steps of the macroblock bit allocation and quanti2er scale prediction, where a and p are constants less than 
zero and e,- Is the error of the macroblocks, defined as the difference between the target and re-encoded AC 
coefficient bit counts. 

^0 32. A method according to claim 27, wherein ^p/^ is defined by 



/-I 

15 ' k^O 

where ti and yare constants and e, Is the error of the i^^ macroblocks, defined as the difference between the target 
and re-encoded AC coefficient bit counts. 

20 33. A method according to claim 26, wherein said encoding step (Iv) comprises the process of forward OCT transfor- 
mation, quantization and variable length coding based on said video data parameters, which includes the quantizer 
scale and a subset of one or more of: DCT type, quantizer scale type, intra quantization matrix, intra DC precision, 
alternate scale and intra variable length coding fomiat. 

25 34. A video decoding system having means for executing the method of any one of claims 1 to 33. 

35. A video decoder including: 

(a) a bitstream parser for receiving a video frame; 
30 (b) an embedded decoder for partially decoding the video frame and fully decoding the video frame to produce 

macroblocks; 

(c) a data analyzer for detemnining video data parameters from the partially decoded video frame or both the 
partially and fully decoded video frame; . 

(d) an embedded encoder for encoding the macroblocks based on the detemiined video data. 

35 

36. A video decoder system having means for supporting lower resolution picture decoding including: 

a video decoder as claimed In claim 35; 

a decimation filter module for receiving the decoded macroblocks and performing a horizontal spatial declma- 
^0 tion of the display pixels to the required resolution using a digital decimation filter; and 

an Interpolation filter module for receiving the decoded reference macroblocks and perfonning a horizontal 
spatial interpolation to achieve a full D1 resolution motion compensation using a digital interpolation filter. 

37. A decoder according to claim 36, wherein the data analyzer is adapted to perfomfi averaging operations for the 
lower resolution picture decoding. 
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